|
|
Accession Number |
TCMCG075C06619 |
gbkey |
CDS |
Protein Id |
XP_007043830.2 |
Location |
complement(join(12912849..12912912,12913020..12913066,12913369..12913626,12913706..12913817,12913904..12914172,12914269..12914415,12914516..12915124,12915350..12916774,12916867..12916927,12917018..12917073,12917195..12917318,12917418..12918118)) |
Gene |
LOC18608860 |
GeneID |
18608860 |
Organism |
Theobroma cacao |
|
|
Length |
1290aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007043768.2
|
Definition |
PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Theobroma cacao] |
CDS: ATGTATGTTGTTGGATCTAATTGGATCGAGTGGAGTAAATTAAAGAAAAACGAGGAAACCCTAGTTGAGATGGGAAAAGATGAAACTAAAGTGGAGGATGTTGAAGAAGGTGAAATTTCGGATTCAGCCTCAATAGAGGAGATCAGTGAGGAAGATTTTAACAAGCAAGATGTTAAGATTTTGAAGGAATCGAAGTCGTCGAAAGGAGGAGAAGCGAATTCGAATTCAAGAGTTTGGACGATGCAGGATCTGTGCAAGTACCCGTCGGTTATTCGAGGGTATGCTTCGGGTTTGTATAATTTTGCTTGGGCACAAGCAGTTCAGAATAAACCTCTGAATGAGATTTTTGTTAAGGATTTTGAACAACCACAACAAGACGAGAACAAGAACTCCAAGCGATCGTCGCCGTCGTCTTCGGTGGCTTCTGTAAATAGCAAAGAGGAGAAGGGTAGTAGTGGAAATTTAGCTGTTAAAGTTGTGATTGATGATGATAGTGAGGATGAAATGGAGGAGGACAAGGTTGTGAATCTAGACAAGGAAGAAGGGGAGTTGGAAGAGGGGGAGATTGATTTAGATTCGGAGCCTAAAGAAAAGGTTTTGAGTAGTGAGGATGGTAATGTTGGTAACTCGGATGAGCTGGAGAAGCGTGCTAACTTGATTCGGGGAGTACTGGAAGGGGTTACTGTGATTGAAGCAGAGAAATCATTCGAGGGAGTTTGTTCCAGACTGCAGAATGCTCTGGAGAGCTTGCGAGCTTTGATATTGGAATGCAGTGTTCCCGCGAAGGATGCCCTTATTCAACTGGCGTTTGGAGCAATTAATTCTGCATTTGTTGCTCTGAACTGTAATTCAAAGGAACAGAATGTGGCCATTTTATCAAGGTTACTTTCCATTGTAAAGGGTCATGATCCCTCTCTGTTCCCTCCTGACAAGATGAAAGAGATTGATGTCATGTTGATCTCTCTGAATTCCCCTGCTAGAGCAATTGATACAGAGAAGGATATGAAGGTTGTAGACGGAGTTAACAAGAAGGATCCTGATGCTTTACCTGAAAATATTTGTCATGATTTGACTGTCACAAATAAGTTGCCTTCATCTGCCAAGTTTGTAATTAATAATAAGCCAAATGCATTAACAGAAACTTTAAAGCCAGGAGTACCTAATTTTAGGAATAGAGGGATTTCACTGCCCCTGCTAGACCTTCACAAGGATCATGATGCAGACAGCCTTCCTTCACCAACGCGAGAAACAACACCATGTTTGCCTGTAAACAAGCCATTGACAAGCGGAGATGTCATGGTTAAATCAGGGTTTATGACAGGTAAAGGTTCACATGATGCAGAAGGTGATAAATTGCACCCTTATGAAACAGATGCCCTCAAAGCCTTTTCTACCTATCAACAAAAATTTGGTCAAGGTTCTTTCTTTTCAAGTGATAGACTTCCAAGCCCAACCCCTTCTGAAGAATCTGGTGATGAGGGTGGTGATAATGGTGGGGAAGTTTCTAGTTCCTCCAGCATTGGTAACTTTAAACCAAATCTGCCCATTTTGGGGCATCCAATTGTTTCTTCAGCACCTCTAGTTGATAGTGCTAGTTCCAGCTTGCAGGGACAGATTACAACTAGAAATGCGACACCAATGAGTTCTGTGTCTAATATAGTGTCGAAATCCTTAGCAAAAAGCAGAGACCCTAGGCTTTGGTTTGCCAACTCTAATGCAAGTGCTTTGGATCTCAATGAACGGCTCTTGCATAATGCATCTAAAGTGGCACCTGTTGGAGGAATAATGGATTCAAGAAAGAAAAAGAGTGTTGAAGAACCTATTTTGGATAGCCCTGCACTTAAAAGACAAAGGAATGAGTTGGAAAATTTGGGGGTTGCTAGGGATGTGCAAACTGTGTCTGGAATTGGTGGCTGGTTAGAGGACACTGATGCTATTGGGTCTCAGATAACAAACAGAAACCAAACTGCAGAGAATTTGGAATCCAATTCCAGGAAAATGGATAATGGAGTAACTAGTTCAAGTACTCTAAGTGGTAAAACTAATATCACTGTTGGTACAAATGAGCAGGTGCCAGTGACAAGTACGAGTACCCCTTCGTTACCTGCTTTGTTGAAAGATATTGCAGTGAATCCAACCATGCTGATAAACATACTTAAGATGGGACAACAGCAGAGATTAGGAGCTGAAGCCCAGCAGAAATCCCCCGATCCTGTAAAAAGTACATTTCATCAGCCAAGCTCAAATTCATTACTGGGAGTAGTTTCCTCCACAAATGTCATTCCCTCCCCTTCTGTGAACAATGTTCCTTCAATTTCGTCTGGGATTTCGTCAAAACCTGCGGGAAATCTTCAGGTTCCTTCTCCGGATGAGTCCGGAAAAATTCGCATGAAACCTCGTGACCCTCGCCGTGTTCTTCATGGAAATTCACTTCAAAGGAGTGGTAGCATGGGACCTGATCAATTAAAAACAAATGGTGCCCTTACTTCAAGCACCCAGGGAAGCAAGGATAATCTGAATGCCCAAAAGCTGGACAGTCAGACAGAATCAAAACCGATGCAATCTCAGCTTGTTCCACCACCAGATATCACTCAACAATTCACTAATAATCTTAAAAATATTGCTGGTATTGTGTCTGTGTCACAAGCATTGACTAGTCTGTCACCAGTGTCCCACAATTTAGTCCCCCAACCAGTACTAATTAAGTCTGACAGCATGGATATGAAAGCACTAGTTTCTAATTCTGAGGATCAGCAGACTGGGGCTGGTTTAGCACCTGAAGCAGGTGCAACAGGTCCTCATTCACAGAATGCATGGGGAGATGTTGAACATCTTTTCGAAAGATATGATGACCAGCAAAAGGCAGCTATCCAGAGAGAAAGGGCAAGGAGGATAGAAGAACAGAAGAAAATGTTTTCTGCACGCAAACTCTGTCTTGTTTTGGATCTAGATCATACACTTCTTAATTCAGCCAAATTTATTGAAGTAGACCCGGTGCATGAGGAGATCTTGAGAAAGAAAGAGGAACAGGATCGTGAAAAACCAGAGAGACATCTTTTCCGCTTTCATCATATGGGAATGTGGACCAAATTGCGACCTGGAATTTGGAATTTCTTAGAGAAGGCTAGTAAGTTGTATGAGCTGCATCTTTACACAATGGGGAACAAGCTATATGCCACGGAGATGGCAAAAGTGCTTGATCCAAAAGGGGTTTTGTTTGCTGGACGTGTCATTTCTAGGGGTGACGACGGAGATCCCTTTGATGGTGATGAGAGGGTTCCTCGGAGTAAGGATCTGGAAGGGGTTCTGGGTATGGAATCGGCTGTGGTTATAATTGATGATTCTGTCAGAGTCTGGCCGCATAATAAGCTTAACTTGATTGTTGTAGAGAGGTATACTTATTTCCCTTGTAGTCGACGCCAATTTGGGCTTCTAGGTCCTTCTCTTCTTGAGATTGACCATGATGAGAGACCAGAAGATGGGACGTTGGCATCTTCTTTGGCGGTTATTGAGAGAATACATCAAGATTTCTTTTCACATCAGAATTTAGATGATGTAGATGTTAGAAATATCCTAGCCTCAGAGCAACGGAAGATTTTGGCTGGTTGTCGCATAGTGTTTAGTAGGGTATTTCCTGTTGGTGAAGCCAATCCTCACCTACACCCATTGTGGCAAACAGCTGAGCAATTTGGAGCTGTGTGCACAAATCAGATAGATGAGCATGTCACACATGTGGTGGCCAACTCTCTTGGAACTGATAAGGTGAATTGGGCTCTATCTACTGGAAAATTTGTTGTCCACCCCGGCTGGGTGGAAGCATCAGCATTGCTTTATCGGAGGGCCAATGAGGTTGACTTTGCCATTAAACCATAA |
Protein: MYVVGSNWIEWSKLKKNEETLVEMGKDETKVEDVEEGEISDSASIEEISEEDFNKQDVKILKESKSSKGGEANSNSRVWTMQDLCKYPSVIRGYASGLYNFAWAQAVQNKPLNEIFVKDFEQPQQDENKNSKRSSPSSSVASVNSKEEKGSSGNLAVKVVIDDDSEDEMEEDKVVNLDKEEGELEEGEIDLDSEPKEKVLSSEDGNVGNSDELEKRANLIRGVLEGVTVIEAEKSFEGVCSRLQNALESLRALILECSVPAKDALIQLAFGAINSAFVALNCNSKEQNVAILSRLLSIVKGHDPSLFPPDKMKEIDVMLISLNSPARAIDTEKDMKVVDGVNKKDPDALPENICHDLTVTNKLPSSAKFVINNKPNALTETLKPGVPNFRNRGISLPLLDLHKDHDADSLPSPTRETTPCLPVNKPLTSGDVMVKSGFMTGKGSHDAEGDKLHPYETDALKAFSTYQQKFGQGSFFSSDRLPSPTPSEESGDEGGDNGGEVSSSSSIGNFKPNLPILGHPIVSSAPLVDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNERLLHNASKVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIGSQITNRNQTAENLESNSRKMDNGVTSSSTLSGKTNITVGTNEQVPVTSTSTPSLPALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIPSPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGPDQLKTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIAGIVSVSQALTSLSPVSHNLVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGPHSQNAWGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFSHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIKP |